METHOD AND APPARATUS FOR ENCODING AND/OR DECODING DIGITAL 
DATA USING BANDWIDTH EXTENSION TECHNOLOGY 



5 BACKGROUND OF THE INVENTION 

This application claims the priority of Korean Patent Application No. 
2003-14485, filed on March 7, 2003, in the Korean Intellectual Property Office, the 
disclosure of which is incorporated herein in its entirety by reference. 

10 

1 . Field of the Invention 

The present invention relates to encoding and decoding of digital data, and 
more particularly, to a method and apparatus for encoding and decoding digital data 
using bandwidth extension technology. 

15 2. Description of the Related Art 

As digital signal processing technologies advance, audio signals are mostly 
stored and played back as digital data. Digital audio storage and/or playback 
devices sample and quantize analog audio signals, transform the analog audio 
signals into pulse code modulation (PCM) audio data, which is a digital signal, and 

20 store the PCM audio data in an information storage medium such as a compact disc 
(CD), a digital versatile disc (DVD), or the like, so that a user can play back data 
from the information storage medium when he/she desires to listen to the PCM audio 
data. Digital audio signal storage and/or reproduction methods considerably 
improve sound quality and remarkably reduce the deterioration of sound caused by 

25 long storage periods compared to analog audio signal storage and/or reproduction 
methods used on a long-play (LP) record, a magnetic tape, or the like. However, 
the large amount of digital data sometimes poses a problem for storage and 
transmission. 

In order to solve these problems, a wide variety of compression technologies 
30 for reducing the amount of digital audio data are used. Moving Picture Expert 

Group audio standards drafted by the International Standard Organization (ISO) or 
AC-2/AC-3 technologies developed by Dolby adopt a method of reducing the amount 
of data using a psychoacoustic model, which results in an effective reduction in the 
amount of data regardless of the characteristics of signals. In other words, MPEG 
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audio standards and AC-2/AC-3 technologies provide almost the same sound quality 
as a CD only at a bit rate of 64Kbps - 384Kbps, that is, at 1/6 - 1/8 that of existing 
digital encoding technologies. 

However, all these technologies comply with a method of detecting, 
5 quantizing, and encoding digital data in an optimum state at a fixed bit rate. Thus, 
when digital data is transmitted via a network, a transmission bandwidth may be 
reduced due to poor network conditions. Also, the network may be disconnected, 
such that network service is not available. Also, when digital data is transformed 
into a smaller bitstream so as to be suitable for mobile devices having a limited 

10 storage capacity, re-encoding should be performed to reduce the amount of data. 
To achieve this, a considerable amount of calculation is required. 

For this reason, the present applicant filed an application for "Bit Rate 
Scalable Audio Encoding and/or Decoding Method and Apparatus Using Bit-Sliced 
Arithmetic Coding (BSAC) Technology" as Korean Patent Application No. 97-61298 

15 on November 19, 1997 in the Korean Intellectual Property Office and has been 

granted Korean Patent Registration No. 261253 on April 17, 2002. According to 
BSAC technology, a bitstream, which has been encoded at a high bit rate, can be 
transformed into a bitstream having low bit rate. Since restoring can be achieved 
using only a portion of a bitstream, even if a network is overloaded, the performance 

20 of a decoder is poor, or a user demands a low bit rate, the user can be provided with 
service at moderate sound quality using only a portion of the bitstream (though the 
performance of the decoder may deteriorate as much as low bit rate). Nevertheless, 
at the lowered bit rate, the performance of the decoder is unavoidably degraded. 

25 SUMMARY OF THE INVENTION 

The present invention provides a digital data encoding and/or decoding 
method and apparatus capable of controlling the bit rate of digital data such that 
even though restoring is carried out using only a portion of a bitstream, high quality 
sound can be reproduced. 

30 According to an aspect of the present invention, there is provided a method of 

encoding digital data. The method includes: bandwidth-extension-encoding the 
digital data, outputting bandwidth-limited data, and generating bandwidth extension 
information; encoding the bandwidth-limited data into a hierarchical structure having 
a base layer and at least one enhancement layer so as to control a bit rate; and 
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multiplexing the encoded bandwidth-limited data and the bandwidth extension 
information. 

According to another aspect of the present invention, there is provided a 
method of encoding audio data. The method includes: 
5 bandwidth-extension-encoding the audio data, outputting bandwidth-limited audio 
data, and generating bandwidth extension information; encoding the 
bandwidth-limited audio data into a hierarchical structure having a base layer and at 
least one enhancement layer so as to control a bit rate; and multiplexing the 
encoded bandwidth-limited audio data and the bandwidth extension information. 

10 According to still another aspect of the present invention, there is provided a 

method of decoding digital data. The method includes: demultiplexing an input 
bitstream and sampling bandwidth-limited data that is encoded into a hierarchical 
structure having a base layer and at least one enhancement layer and bandwidth 
extension information; decoding at least a portion of the bandwidth-limited data 

15 corresponding to the base layer; and generating digital data in at least a portion of a 
band that is not covered by the decoded portion of the bandwidth-limited data based 
on the decoded portion of the bandwidth-limited data and with reference to the 
bandwidth extension information, and then patching the generated digital data to the 
decoded portion of the bandwidth-limited data. 

20 According to yet another aspect of the present invention, there is provided a 

method of decoding audio data. The method includes: demultiplexing an input 
audio bitstream and sampling bandwidth-limited audio data that is encoded into a 
hierarchical structure having a base layer and at least one enhancement layer and 
bandwidth extension information; decoding at least a portion of the bandwidth-limited 

25 audio data corresponding to the base layer; and generating audio data in at least a 
portion of a band that is not covered by the decoded portion of the bandwidth-limited 
audio data based on the decoded portion of the bandwidth-limited audio data and 
with reference to the bandwidth extension information, and then patching the 
generated digital data to the decoded portion of the bandwidth-limited audio data. 

30 According to yet another aspect of the present invention, there is provided an 

apparatus for encoding digital data. The apparatus includes: a bandwidth extension 
encoder that bandwidth-extension-encodes the digital data, outputs 
bandwidth-limited data, and generates bandwidth extension information; a fine grain 
scalability encoder that encodes the bandwidth-limited data into a hierarchical 
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structure having a base layer and at least one enhancement layer so as to control a 
bit rate; and a multiplexer that multiplexes the encoded bandwidth-limited data and 
the bandwidth extension information. 

According to yet another aspect of the present invention, there is provided an 
5 apparatus of encoding audio data. The apparatus includes: a bandwidth extension 
encoder that bandwidth-extension-encodes the audio data, outputs 
bandwidth-limited audio data, and generates bandwidth extension information; a fine 
grain scalability encoder that encodes the bandwidth-limited audio data into a 
hierarchical structure having a base layer and at least one enhancement layer so as 

10 to control a bit rate; and a multiplexer that multiplexes the encoded bandwidth-limited 
audio data and the bandwidth extension information. 

According to yet another aspect of the present invention, there is provided an 
apparatus for decoding digital data. The apparatus includes: a demultiplexer that 
demultiplexes an input bitstream and samples bandwidth-limited data that is 

15 encoded into a hierarchical structure having a base layer and at least one 

enhancement layer and bandwidth extension information; a fine grain scalability 
decoder that decodes at least a portion of the sampled bandwidth-limited data 
corresponding to the base layer; and a bandwidth extension decoder that generates 
digital data in at least a portion of a band that is not covered by the decoded portion 

20 of the bandwidth-limited data based on the decoded portion of the bandwidth-limited 
data and with reference to the bandwidth extension information and the patches the 
generated digital data to the decoded portion of the bandwidth-limited data. 

BRIEF DESCRIPTION OF THE DRAWINGS 
25 The above and other features and advantages of the present invention will 

become more apparent by describing in detail exemplary embodiments thereof with 
reference to the attached drawings in which: 

FIG. 1 is a block diagram of an encoding apparatus according to the present 
invention; 

30 FIG. 2 is a block diagram of an encoding apparatus according to an 

embodiment of the present invention; 

FIG. 3 illustrates an example of the realization of the encoding apparatus 
shown in FIG. 2; 
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FIG. 4 is a block diagram of a decoding apparatus according to the present 
invention; 

FIG. 5 is a block diagram of a decoding apparatus according to an 
embodiment of the present invention; 
5 FIG. 6 illustrates an example of the realization of the decoding apparatus 

shown in FIG. 5; 

FIG. 7 illustrates the structure of a bitstream output from a fine grain 
scalability (FGS) encoder 2; 

FIG. 8 illustrates the detailed structure of side information shown in FIG. 7; 
10 FIG. 9 illustrates the structure of a bitstream output from a multiplexer 3; 

FIG. 10 is a referential view for explaining bandwidth extension decoding 
performed by a bandwidth extension (BWE) decoder 9 in more detail; 

FIG. 1 1 is a flowchart for explaining an encoding method according to the 
present invention; 

15 FIG. 12 is a flowchart for explaining an encoding method according to an 

embodiment of the present invention; 

FIG. 13 is a flowchart for explaining a decoding method according to the 
present invention; and 

FIG. 14 is a flowchart for explaining a decoding method according to an 
20 embodiment of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
Hereinafter, preferred embodiments of the present invention will be described 
in detail with reference to the attached drawings. 
25 FIG. 1 is a block diagram of an encoding apparatus according to the present 

invention. Referring. to FIG. 1, the encoding apparatus, which encodes digital data 
and outputs the digital data as a bitstream, includes a bandwidth extension (BWE) 
encoder 1 , a fine grain scalability (FGS) encoder 2, and a multiplexer 3. 

The BWE encoder 1 BWE-encodes digital data, outputs bandwidth-limited 
30 digital data, and generates BWE information. BWE encoding refers to a technique 
for receiving digital data, slicing off a portion of the digital data in a high frequency 
band, and generating side information necessary for restoring the sliced portion of 
the digital data. Here, the remaining portion of the digital data is called 
"bandwidth-limited data" and the side information is called "BWE information". An 
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example of a BWE technique is a Spectral Band Replication (SBR) technology 
developed by Coding Technologies. The details of the SBR technology are 
disclosed in the "Convention Paper 5560" presented at the 1 12 th Convention of 
Audio Engineering Society held on May 10-13, 2002. 

The FGS encoder 2 encodes the bandwidth-limited digital data into a 
hierarchical structure having a base layer and at least one enhancement layer so as 
to control a bit rate. FGS encoding refers to a technique for encoding data into a 
structure having a plurality of layers so as to control a bit rate, i.e., provide FGS. 
The BSAC technology disclosed in Korean Patent Application No. 97-61298 is an 
example of FGS coding. 

The multiplexer 3 multiplexes the bandwidth-limited digital data encoded by 
the FGS encoder 2 and the BWE information generated by the BWE encoder 1. 

FIG. 2 is a block diagram of an encoding apparatus according to an 
embodiment of the present invention. Referring to FIG. 2, the encoding apparatus, 
which receives and encodes PCM audio data, and then outputs an audio bitstream, 
includes a BWE encoder 1 , a FGS encoder 2, and a multiplexer 3. Compared to 
the encoding apparatus shown in FIG. 1, the encoding apparatus shown in FIG. 2 is 
characterized by processing audio data. Blocks performing the same functions as 
those shown in FIG. 1 are denoted by the same reference numerals, and thus 
repeated descriptions will be omitted. 

The BWE encoder 1 BWE-encodes PCM audio data, outputs 
bandwidth-limited PCM audio data, and generates BWE information. The FGS 
encoder 2 encodes the bandwidth-limited PCM audio data into a hierarchical 
structure having a base layer and at least one enhancement layer so as to control a 
bit rate. In other words, the FGS encoder 2 differentially encodes side information 
corresponding to the base layer, bit-sliced-encodes a plurality of quantization 
samples corresponding to the base layer, differentially encodes side information 
corresponding to a next enhancement layer until a plurality of predetermined layers 
are completely encoded, and bit-sliced-encodes a plurality of quantization samples 
corresponding to the next enhancement layer. Here, the side information contains 
scale factor information and coding model information, and the quantization samples 
are obtained by transforming and quantizing input digital data. The side information 
and the quantization samples will be explained in detail later. The multiplexer 3 
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multiplexes the bandwidth-limited PCM audio data encoded by the FGS encoder 2 
and the BWE information generated by the BWE encoder 1 . 

FIG. 3 illustrates an example of the realization of the encoding apparatus 
shown in FIG. 2. Referring to FIG. 3, the encoding apparatus includes a BWE 
5 encoder 1 , a FGS encoder 2, and a multiplexer 3. Blocks performing the same 
functions as those shown in FIG. 2 are denoted by the same reference numerals, 
and thus repeated descriptions will be omitted. 

In particular, the FGS encoder 2 includes a transforming unit 21 , a 
psychoacoustic unit 22, and a quantizing unit 23, and a FGS encoding unit 24. The 

10 transforming unit 21 receives PCM audio data that is an audio signal in the time 
domain and transforms the PCM audio data into an audio signal in the frequency 
domain with reference to psychoacoustic model information provided by the 
psychoacoustic unit 22. The characteristics of audio signals able to be perceived 
by humans, hereinafter referred to as perceptual audio signals, are not much 

15 different in the time domain. In contrast, the characteristics of perceptual and 

unperceptual audio signals in the frequency domain are much different considering 
the psychoacoustic model. Thus, compression efficiency can be improved by 
assigning a different number of bits to each frequency band. 

The psychoacoustic unit 22 provides information on a psychoacoustic model 

20 such as attack detection information or the like to the transforming unit 21 , packs the 
audio signal transformed by the transforming unit 21 into sub-band audio signals, 
calculates a masking threshold for each of the sub-bands using a masking effect 
resulting from the interaction among the sub-band signals, and provides the masking 
threshold to the quantizing unit 23. The masking threshold indicates the maximum 

25 power of an audio signal that human cannot perceive due to the interaction between 
audio signals. In the present embodiment, the psychoacoustic unit 22 calculates a 
masking threshold and the like for stereo components using Binaural Masking Level 
Depression (BMLD). 

The quantizing unit 23 scalar-quantizes each of the sub-band audio signals 

30 based on corresponding scale factor information to reduce quantization noise power 
in each of the sub-bands to be less than the masking threshold provided by the 
psychoacoustic unit 22 and then outputs quantization samples, so that a human can 
hear the sub-band audio signals but not perceive the quantization noise therein. In 
other words, the quantizing unit 23 quantizes the sub-band audio signals in such a 
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way that a noise-to-mask ratio (NMR), indicating a ratio of noise generated in each 
sub-band to the masking threshold calculated by the psychoacoustic unit 22, in 
full-bandwidth is OdB or less. An NMR of OdB or less indicates that a human cannot 
hear quantization noise. 

The FGS encoding unit 24 encodes quantization samples and side 
information belonging to each layer into a hierarchical structure. The side 
information contains scale band information, coding band information, scale factor 
information, and coding model information corresponding to each layer. The scale 
band information and the coding band information may be packed as header 
information and then transmitted to a decoding apparatus. Alternatively, the scale 
band information and the coding band information may be encoded and packed as 
side information corresponding to each layer and then transmitted to the decoding 
apparatus. Also, since scale band information and coding band information is 
already stored in the decoding apparatus, the scale band information and the coding 
band information may not be transmitted to the decoding apparatus. 

In more detail, the FGS encoding unit 24 encodes side information containing 
scale factor information and coding model information corresponding to a first layer 
while bit-sliced-encoding quantization samples corresponding to the first layer with 
reference to the coding model information. The bit-sliced-encoding indicates coding 
used in the above-described BSAC and sequentially lossless-encodes most 
significant bits, next significant bits, and least significant bits. A second layer 
undergoes the same process as the first layer. In other words, a plurality of 
predetermined layers are sequentially encoded layer by layer until they are 
completely encoded. The first layer is named a base layer and the remaining layers 
are named enhancement layers. A more detailed description of the hierarchical 
structure will be provided later. 

The scale band information is necessary for properly performing quantization 
depending on the frequency characteristics of an audio signal and informs each layer 
of a scale band corresponding thereto when a frequency domain is divided into a 
plurality of bands and each of the bands is assigned a proper scale factor. As a 
result, each layer belongs to at least one scale band. Each scale band is assigned 
one scale factor. The coding band information is necessary for properly carrying 
out encoding depending on the frequency characteristics of an audio signal and 
informs each layer of an encoding band corresponding thereto when a frequency 
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domain is divided into a plurality of bands and each of the bands is assigned a 
proper coding model. The scale bands and the encoding bands are properly 
divided by tests, and then scale factors and coding models corresponding thereto 
are determined. 

5 The multiplexer 3 multiplexes the bandwidth-limited audio data and the BWE 

information in such an order that data of the encoded quantization samples 
corresponding to the base layer is located, BWE information is located, and data of 
the encoded quantization samples corresponding to the remaining enhancement 
layers is located or in such an order that BWE information is located, data of the 
10 encoded quantization samples corresponding to the base layer is located, and data 
of the encoded quantization samples corresponding to the remaining enhancement 
layers is located. 

FIG. 4 is a block diagram of a decoding apparatus according to the present 
invention. Referring to FIG. 4, the decoding apparatus, which decodes a bitstream 

15 and then outputs digital data, includes a demultiplexer 7, a FGS decoder 8, and a 
BWE decoder 9. 

The demultiplexer 7 demultiplexes an input bitstream to sample 
bandwidth-limited data, which has been encoded into a hierarchical structure having 
a base layer and at least one enhancement layer, and BWE information therefrom. 

20 Here, the bandwidth-limited data and the BWE information is the same as that 

described with reference to FIG. 1 . The FGS decoder 8 decodes at least a portion 
of the bandwidth-limited data sampled by the demultiplexer 7 corresponding to the 
base layer. The layer on which decoding is performed depends on the state of a 
network, a user's selection, or the like. Based on the portion of the 

25 bandwidth-limited data decoded by the FGS decoder 8 and with reference to the 
BWE information sampled by the demultiplexer 7, the BWE decoder 9 generates 
digital data in at least a portion of a band that is not covered by the bandwidth-limited 
data decoded by the FGS decoder 8 and then patches the generated digital data to 
the bandwidth-limited data decoded by the FGS decoder 8. Even if the band-limited 

30 data decoded by the FGS decoder 8 is only base band data, the BWE decoder 9 

creates missing band data and patches the missing band data to the base band data. 
As a result, quality of the decoded portion of the bandwidth-limited data can be 
improved. 
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FIG. 5 is a block diagram of a decoding apparatus according to an 
embodiment of the present invention. Referring to FIG. 5, the decoding apparatus, 
which receives and decodes an audio bitstream, and then outputs audio data, 
includes a demultiplexer 7, a FGS decoder 8, and a BWE decoder 9. Compared to 
5 the decoding apparatus shown in FIG. 4, the decoding apparatus shown in FIG. 5 is 
characterized by processing audio data. Therefore, blocks carrying out the same 
functions as those of FIG. 4 are denoted by the same reference numerals, and thus 
repeated descriptions will be omitted. 

The demultiplexer 7 demultiplexes an input audio bitstream to sample 

10 bandwidth-limited audio data, which has been encoded into a hierarchical structure 
having a base layer and at least one enhancement layer, and BWE information 
therefrom. The FGS decoder 8 decodes at least a portion of the bandwidth-limited 
audio data corresponding to the base layer. Based on the portion of the 
bandwidth-limited audio data decoded by the FGS decoder 8 and with reference to 

15 the BWE information sampled by the demultiplexer 7, the BWE decoder 9 generates 
audio data in at least a portion of a band that is not covered by the portion of 
bandwidth-limited audio data decoded by the FGS decoder 8 and then patches the 
generated audio data to the portion of the bandwidth-limited audio data decoded by 
the FGS decoder 8. 

20 FIG. 6 illustrates an example of the realization of the decoding apparatus 

shown in FIG. 5. Referring to FIG. 6, the decoding apparatus includes a 
demultiplexer 7, a FGS decoder 8, and a BWE decoder 9. Blocks carrying out the 
same functions as those of FIG. 5 are denoted by the same reference numerals, and 
thus repeated descriptions will be omitted. 

25 In particular, the FGS decoder 8 performs decoding up to a target layer that is 

determined depending on the state of a network, the performance of the decoding 
apparatus, a user's selection, and so forth in order to control a bit rate. The FGS 
decoder 8 includes a FGS decoding unit 81, a dequantizing unit 82, and an 
inverse-transforming unit 83. The FGS decoding unit 81 performs decoding up to a 

30 target layer of an audio bitstream. In more detail, the FGS decoding unit 81 

lossless-decodes encoded quantization samples corresponding to each layer based 
on coding model information obtained by decoding side information containing scale 
factor information and coding model information corresponding to each layer in order 
to obtain quantization samples. 
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Scale band information and coding band information may be obtained from 
header information of the audio bitstream or may be obtained by decoding side 
information of each layer. Alternatively, the decoding apparatus may store scale 
band information and coding band information in advance. The dequantizing unit 
5 82 dequantizes and reconstructs quantization samples of each layer based on scale 
factor information corresponding to each layer. The inverse-transforming unit 83 
frequency/time-maps the reconstructed samples, transforms the mapped samples 
into time domain PCM audio data, and outputs the time domain PCM audio data. 
The BWE decoder 9 includes a transforming unit 91, a high frequency 

10 generating unit 92, an adjusting unit 93, and a synthesizing unit 94. The 

transforming unit 91 transforms the time domain PCM audio data output from the 
inverse-transforming unit 83 into frequency domain data. The frequency domain 
data is referred to as a low frequency portion. The high frequency generating unit 
92 generates a portion that is not covered by the frequency domain data, i.e., a high 

15 frequency portion by replicating the low frequency portion with reference to BWE 

information and then patching the replicated low frequency portion to the frequency 
domain data, i.e., the original low frequency portion. The adjusting unit 93 adjusts 
the level of the high frequency portion generated by the high frequency generating 
unit 92 using envelope information contained in the BWE information. The 

20 envelope information, which is transmitted from an encoding node, represents 

envelope information of audio data corresponding to a high frequency portion that is 
sliced by the encoding node during BWE encoding. The synthesizing unit 94 
synthesizes the low frequency portion output from the transforming unit 91 and the 
high frequency portion output from the adjusting unit 93 and then outputs PCM audio 

25 data. As described above, although the FGS decoder 8 decodes only base band 
audio data, the BWE decoder 9 reconstructs missing band audio data and then 
patches the missing band audio data to the base band audio data. As a result, the 
quality of the base band audio data can be improved. 

FIG. 7 illustrates the structure of a bitstream output from the FGS encoder 2. 

30 Referring to FIG. 7, the frame of a bitstream is encoded by the FGS encoder 2 by 
mapping quantization samples and side information into a hierarchical structure for 
fine grain scalability (FGS). In other words, the frame has a hierarchical structure in 
which a bitstream of a lower layer is included in a bitstream of an enhancement layer. 
Side information necessary for each layer is encoded on a layer-by-layer basis. 
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A header area in which header information is stored is located in the starting 
part of a bitstream, information of a zero th layer is packed, and information of first 
through N m layers that are enhancement layers is sequentially packed. A base 
layer ranges from the header area to the information of the zero th layer, a first layer 
5 ranges from the header area to the information of the first layer, and a second layer 
ranges from the header area to the information of the second layer In the same 
manner, the most enhancement layer ranges from the header area to the information 
of the N m layer, i.e., from the base layer to the N th layer. Side information and 
encoded data is stored as information of each layer. For example, side information 

10 2 and encoded quantization samples are stored as the information of the second 
layer. Here, N is an integer that is greater than or equal to "1". 

FIG. 8 illustrates the detailed structure of the side information shown in FIG. 7. 
Referring to FIG. 8, side information and encoded quantization samples are stored 
as information of an arbitrary layer. In the present embodiment, if Huffman 

15 encoding is performed as lossless-encoding, side information contains Huffman 

coding model information, quantization factor information, channel side information, 
and other side information. Huffman coding model information refers to index 
information of a Huffman coding model to be used for encoding or decoding 
quanitzation samples contained in a corresponding layer. The quantization factor 

20 information informs a corresponding layer of the size of a quantizing step suitable for 
quantizing or dequantizing audio data contained in the corresponding layer. The 
channel side information refers to information on a channel such as middle/side 
(M/S) stereo. The other side information is flag information indicating whether the 
M/S stereo is used. 

25 FIG. 9 illustrates the structure of a bitstream output from the multiplexer 3. 

Referring to FIG. 9, a zero th layer, which is a base layer encoded by the FGS 
encoder 2, is located in the starting part of the bitstream, BWE information is located 
after the zero th layer, and enhancement layers, i.e., a first layer, a second layer, 
and an N th layer, are located after the BWE information. Although a decoding node 

30 receives or decodes only the information of the base layer, the decoding node can 
create missing layer information based on the decoded data of the base layer and 
with reference to the BWE information. 

FIG. 10 is a view for explaining BWE decoding performed by the BWE 
decoder 9 in detail. Referring to FIG. 10, a striped portion denotes data decoded by 
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the FGS decoder 8 and a dotted portion denotes data created by the BWE decoder 9. 
When all data within a quarter portion of a sampling frequency Fs belongs to a base 
layer, FIG. 10 (a) illustrates a case where only base band data is decoded by a 
decoding node, and FIGS. 10 (b), (c), and (d) illustrate a case where data 
corresponding to the base layer and at least one enhancement layer are decoded by 
the FGS decoder 8. In other words, the FGS decoder 8 is able to decode data so 
as to control a bit rate, and the BWE decoder 9 is able to create missing band data 
that is not decoded by the FGS decoder 8. 

Encoding and decoding methods according to a preferred embodiment of the 
present invention will be described based on the above-described structure. 

FIG. 1 1 is a flowchart for explaining an encoding method according to the 
present invention. Referring to FIG. 1 1 , in step 1 101 , an encoding apparatus 
BWE-encodes digital data, outputs bandwidth-limited data, and generates BWE 
information. In step 1 102, the coding apparatus encodes the bandwidth-limited 
data into a hierarchical structure having a base layer and at least one enhancement 
layer so as to control a bit rate. Here, the encoding apparatus encodes side 
information corresponding to the base layer, bit-sliced-encodes a plurality of 
quantization samples corresponding to the base layer, and encodes side information 
and quantization samples corresponding to a next enhancement layer until a plurality 
of predetermined layers are completely encoded. In step 1 103, the encoding 
apparatus multiplexes the encoded bandwidth-limited data and the BWE information 
and then outputs a bitstream. Here, the encoding apparatus multiplexes the 
encoded bandwidth-limited data and the BWE information in such an order that a 
portion of the encoded bandwidth-limited data corresponding to the base layer is 
located, the BWE information is located, portions of the bandwidth-limited data 
corresponding to the remaining enhancement layers are located or in such an order 
that the BWE information is located, the portion of the encoded bandwidth-limited 
data corresponding to the base layer is located, and the portions of the encoded 
bandwidth-limited data corresponding to the remaining enhancement layers are 
located. 

FIG. 12 is a flowchart for explaining an encoding method according to an 
embodiment of the present invention. Referring to FIG. 12, in step 1201, an 
encoding apparatus BWE-encodes audio data, outputs bandwidth-limited audio data, 
and generates BWE information corresponding to a base layer. The BWE 
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information of the base layer is necessary for generating missing band audio data 
based on audio data corresponding to the base layer using a decoding node. The 
encoding apparatus encodes the bandwidth-limited audio data into a hierarchical 
structure having a base layer and at least one enhancement layer. In more detail, 
5 the encoding apparatus transforms audio data corresponding to each layer into 
bandwidth-limited audio data on a layer-by-layer basis in step 1202, quantizes the 
bandwidth-limited audio data in step 1203, and lossless-encodes the quantized 
audio data, and packages the lossless-encoded audio data into a hierarchical 
structure so as to a bit rate. In step 1205, the encoding apparatus multiplexes the 

10 encoded bandwidth-limited audio data and the BWE information and then outputs a 
bitstream. In more detail, the encoding apparatus multiplexes the encoded 
bandwidth-limited data and the BWE information in such an order than a portion of 
the encoded bandwidth-limited data corresponding to the base layer is located, the 
BWE information is located, portions of the encoded bandwidth-limited data 

15 corresponding to the remaining enhancement layers are located or in such an order 
that the BWE information is located, the portion of the encoded bandwidth-limited 
data corresponding to the base layer is located, and the portions of the encoded 
bandwidth-limited data corresponding to the remaining enhancement layers are 
located. 

20 FIG. 13 is a flowchart for explaining a decoding method according to the 

present invention. Referring to FIG. 13, in step 1301, a decoding apparatus 
demultiplexes an input bitstream and samples bandwidth-limited data, which has 
been encoded into a hierarchical structure having a base layer and at least one 
enhancement layer, and BWE information. In other words, the decoding apparatus 

25 demultiplexes the input bitstream in such an order that it samples data 

corresponding to the base layer, BWE information, and data corresponding to the 
remaining enhancement layers from the input bitstream or samples the BWE 
information, the data corresponding to the base layer, and the data corresponding to 
the remaining enhancement layers from the input bitstream. In step 1302, the 

30 decoding apparatus decodes at least a portion of bandwidth-limited data 

corresponding to the base layer. In more detail, the decoding apparatus decodes 
side information corresponding to the base layer, bit-sliced-decodes a plurality of 
quantization samples corresponding to the base layer, and decodes side information 
and a plurality of quantization samples corresponding to a next enhancement layer 



until a plurality of predetermined layers are completely decoded. In step 1303, the 
decoding apparatus generates digital data in at least a portion of a band that is not 
covered by the portion of the bandwidth-limited data decoded in step 1302, based on 
the portion of the bandwidth-limited data decoded in step 1302 and with reference to 
5 the BWE information, and then patches the generated digital data to the decoded 
portion of the bandwidth-limited data. 

FIG. 14 is a flowchart for explaining a decoding method according to an 
embodiment of the present invention. Referring to FIG. 14, in step 1401, a 
decoding apparatus demultiplexes an input audio bitstream and then samples 

10 bandwidth-limited audio data, which has been encoded into a hierarchical structure 
having a base layer and at least one enhancement layer, and BWE information. In 
other words, the decoding apparatus demultiplexes the input audio bitstream in such 
an order that it samples data corresponding to the base layer, BWE information, and 
data corresponding to the remaining enhancement layers from the input audio 

15 bitstream or in such an order that it samples the BWE information, the data 

corresponding to the base layer, and the data corresponding to the remaining 
enhancement layers from the input audio bitstream. The decoding apparatus 
decodes at least a portion of the bandwidth-limited audio data corresponding to the 
base layer so as to control a bit rate. In more detail, the decoding apparatus 

20 performs lossless-decoding up to a target layer in step 1402, performs dequantizaing 
in step 1403, and performs inverse-transforming in step 1404. In step 1405, the 
decoding apparatus generates audio data in at least a portion of a band that is not 
covered by the portion of the bandwidth-limited audio data obtained in step 1404, 
based on the portion of the bandwidth-limited audio data obtained in step 1404 and 

25 with reference to the BWE information. 

As described above, the present invention can provide a bit rate scalable 
encoding and decoding method and apparatus by which high quality sound can be 
provided by restoring only a portion of a bitstream. 

While the present invention has been particularly shown and described with 

30 reference to exemplary embodiments thereof, it will be understood by those of 

ordinary skill in the art that various changes in form and details may be made therein 
without departing from the spirit and scope of the present invention as defined by the 
following claims. 
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